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(54) Title: A METHOD FOR STORING AND RETRIEVING DATA AND A MEMORY ARRANGEMENT 

(57) Abstract 

The invention relates to a storage and retrieval 
method and to a memory arrangement. Storage is 
performed by proceeding on the basis of a search key in 
a directory structure having nodes at several different 
levels. An individual node may be (i) an internal node 
comprising a multidimensional array, or (ii) a leaf node 
containing a data unit or a pointer to a stored data 
unit. To minimize the storage space required, storage is 
performed by (a) selecting a predetermined dimension- 
specific number (Kj) of bits and forming therefrom a 
search word on the basis of which the address of the 
next node is sought from the internal node at the root 
level of the tree-shaped hierarchy, and proceeding to 
said node, (b) selecting from the unselected bits in the 
search key related to each dimension a predetermined 
dimension-specific number (kj) of bits and forming 
therefrom a search word with which the address of 
a further new node at a lower level is sought from the 
array of the node that has been accessed, (c) repeating 
step (b) until an empty element has been encountered 
or until the address of the new node at a lower level is 
the address of a leaf node, and (d) storing the data unit 
or a pointer referring to the data unit in the leaf node. 
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A method for storing and retrieving data and a memory 
arrangement 

The invention relates to methods in accordance 
5 with the preambles of the appended claims 1, 6, 18 and 

20 for storing . data , and to methods in accordance with 
the preambles of the appended claims 11, 15, 22 and 24 
for retrieving data. The invention also relates to 
associative memory arrangements in accordance with the 
10 preambles of the appended claims' 25 and 26. 

The approach in accordance with the invention is 
intended for use primarily in connection with central 
memory databases. A suitable application is the main- 
tenance of a subscriber database in a telephone ex- 
15 change. 

The prior art unidimensional directory structure 
termed digital trie (the word "trie" is derived from the 
English word "retrieval") is the underlying basis of the 
principle of the present invention. The digital trie is 
a tree-like structure composed of two types of nodes: 
leaf nodes containing a record, and internal nodes guid- 
ing the retrieval. An internal node is an array having a 
size of two by the power of k (2*) elements. If an ele- 
ment is in use, it refers either to an internal node at 
the next level in the directory tree or to a leaf node 
containing a record. In other cases, the element is free 
(empty):. Search in the database proceeds by examining 
the search key (which in the case of a subscriber data- 
base in a telephone exchange, for instance, is typically 
the binary numeral corresponding to the telephone number 
of the subscriber) k bits at a time. The bits to be 
searched are selected in such a way that at the root 
level of the structure (in the first internal node), k 
leftmost bits are searched; at the second level of the 
structure, k bits next to the leftmost bits are 
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searched, etc. The bits to be searched are interpreted 
as an unsigned binary integer that is employed directly 
to index the element array (the index indicates a given 
element in the array) . If the element indicated by the 

5 index is free , the search will terminate as unsuccess- 

ful . If the element refers to an internal node at the 
next level, k next bits extracted from the search key 
are searched at that level in the manner described 
above . As a result of comparison, the routine branches 

0 off either to an internal node at the next level or to a 

leaf node typically containing a key-pointer pair. I.f 
the element refers to a leaf node containing a record, 
the key scored therein is compared with the search key. 
The entire search key is thus compared only after the 

5 search has found a leaf node. Where zhe keys are equal, 

the search is successful, and the desired data unit is 
obtained at the storage address indicated by the pointer 
of the leaf node. Where the keys differ, the search 
terminates as unsuccessful. 

3 Figure 1 illustrates an example of a digital trie 

structure in which the key has a length of 4 bits and 
k=2 , and thus each internal node has 2 2 =4 elements, and 
two bits extracted from the key are searched at each 
level. Leaf nodes containing a record are denoted with 

3 references A, B, C, D...H...M, N, 0 and P. Thus a leaf 

node is a node that does not point to a lower level in 
the tree. The internal nodes are denoted with references 
VI... V5 and internal node elements with reference VA in 
Figure 1 . 

3 In the exemplary case shown in Figure 1, the 

search keys for the leaf nodes shown are as follows: 
A=0000, B=0001, C=0010, . . . , H = 0111, . . . and P=llll. In 
this case, a pointer is stored in each leaf node to that 
storage location in the database SD at which the actual 

3 data, e.g. the telephone number of the -pertinent sub- 
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scriber and other information relating to that sub- 
scriber, is to be found. The actual subscriber data may 
be stored in the database for instance as a sequential 
file of the type shown in the figure. The search is per- 

5 formed on the basis, of the search key of record H, for 

example, by first extracting from the search key the two 
leftmost bits (01) and interpreting them, which delivers 
the second element of node VI, containing a pointer to 
node V3 at the next level. At this level, the two next 

0 bits (11) are extracted from the search key, thus 

yielding the fourth element of that node, pointing to 
record H. 

Instead of a pointer, the leaf node may contain 
(besides a search key) an actual data file. Thus for 

5 example the data relating to subscriber A (Figure 1) may 

be located in leaf node A, the data relating to sub- 
scriber B in leaf node B, etc. Thus in the first em- 
bodiment (in which a key-pointer pair is stored in the 
leaf node) , the digital trie structure serves as a 

0 directory structure only, and in the second embodiment 

(in which the key and the actual data are stored in the 
leaf node) it serves as an associative memory. (In an 
associative memory, the storage locations are identified 
on the basis of their contents.) 

5 In the approach of the present invention, the 

search key is multidimensional in contrast to the above, 
that is, the search key comprises a number of attributes 
(for example the family name and one or more surnames of 
a subscriber) , and thus k i search key bits of an attri- 

0 bute are searched at a time in each attribute (i is an 

index corresponding to an attribute) . 

Previously, the unidimensional trie structure 
described above has been expanded into a multidimen- 
sional "EXCELL" structure (abbreviation for the words 

5 "extendible cell"), which is designed to support 
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searches relating to geometric applications for maximum 
efficiency. The EXCELL structure comprises cells parti- 
tioning the search space, and a separate directory part. 
The cell division is executed by recursive subdivision 
5 of the search space into two equal parts alternately in 

the direction of each axis of the search space. The 
result of the cell division is a multidimensional trie 
structure. The directory part is a grid in tabular form, 
wherein each grid element is equal to the smallest cell 

10 in the search space. The grid element points to a cell 

in the search space, and more than one grid elements may 
point to one cell. (The EXCELL structure is described in 
Tamminen, M . : The EXCELL method for efficient geometric 
access to data, Helsinki 1981, Acta Polytechnica 

15 Scandinavica , Mathematics and Computer Science Series 

#34, to which the interested reader is referred for a 
more comprehensive exposition.) 

Such a multidimensional trie structure, in which 
the cell size is dynamic (the directory part is doubled 

20 when data are entered into the structure), is attended 

by the problem of fast (exponential) growth of the 
directory part caused by certain kinds of (non-uniform) 
search key distribution. In consequence, the storage 
occupancy (memory circuit requirement) will increase in- 

25 ordinately and the efficiency (speed) of storage manage- 

ment will be impaired. 

: It is an object of the present invention to elim- 
inate the problems described above and to provide a 
memory in which the directory part can be contained in a 

30 smaller storage space than hitherto and which will be 

easier to manage than previously. This object is 
achieved with the storage and retrieval methods of the 
invention, the storage method being characterized in its 
basic form by (a) selecting from the search key related 

35 to each dimension a predetermined dimension-specific 
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number ( k i ) of bits and forming therefrom a search word 
on the basis of which the address of the next node is 
sought from an internal node at the root level of the 
tree-shaped hierarchy, and proceeding to said node, (b) 
5 selecting from the unselected bits in the search key 

related to each dimension a predetermined dimension- 
specific number (kj of bits and forming therefrom a 
search word with which the address of a further new node 
at a lower level is sought from the array of the node 

10 that has been accessed, (c) repeating step (b) until an 

empty element has been encountered or until the address 
of the new node at a lower level is the address of a 
leaf node, and (d) storing a pointer in the leaf node 
and said data unit at the storage location indicated by 

15 the pointer or alternatively storing said data unit in 

the leaf node. The retrieval methods in accordance with 
the invention are, in their basic form, characterized by 
that which is set forth in the characterizing portion of 
the appended claims 11 and 15. For the basic forms 

20 referred to above, a special case in which the directory 

structure comprises one node only may also be employed. 
The storage methods in accordance with this embodiment 
are characterized in that which is set forth in the 
characterizing portion of the appended claims 18 and 20, 

25 and the retrieval methods are in turn characterized in 

that which is set forth in the characterizing portion of 
the appended claims 22 and 24. The associative memory 
arrangements of the invention are in turn characterized 
in that which is set forth in the characterizing portion 

30 of the appended claims 25 and 26. 

The idea of the invention is to perform address 
computation in a multidimensional trie structure in such 
a way that a given predetermined number of bits is sel- 
ected from each dimension independently of the . other 

35 dimensions, a typical implementation being such that in 



WO 95/34155 



PCT/FI95/00319 



order to calculate one element array index, an unequal 
number of bits is selected from the search keys of the 
different: dimensions. Hence a fixed limit independent of 
the other dimensions is set for each dimension in an 
5 individual internal node in the trie structure, by pre- 

determining the number of search key bits to be searched 
in each dimension. Thus the selection in each dimension 
is independent of the selections in the other dimen- 
sions. This course of action avoids excessive growth of 

10 the suruciure in the lateral direction on the one hand 

(the size of the internal node has an upper limit), and 
excessive growth of the structure in depth in the 
direction cf any of the dimensions on the other hand 
(depth, i.e. the number of levels in each dimension, is 

15 equal to the total length of the search key in bits 

divided by the number of bits to be searched at a time, 
i.e., also the depth can be influenced by a parameter 
determining the size of the internal node) . Since the 
directory can thus be given specific limits in the 

20 lateral and depth directions in advance, storage require- 

ment will also be curbed. 

With the approach of the invention, the need for 
memory circuits in the hardware will be smaller than 
heretofore, and the storage management will be effect- 

25 ive, particularly when the distribution of the values of 

the search keys is known in advance (which is typically 
the case in a telephone exchange, for instance) . 

Another advantage of the invention is that- it is 
possible to design the structure (by selecting the para- 

30 meters) so as to favour, besides full key retrievals, 

also partial key retrievals * equitably in all directions. 
The method also permits retrievals in the directions of 
certain dimensions only, i.e. partial key retrievals, 
with maximum efficiency. These characteristics are of 

35 significance particularly in real-time databases requir- 
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ing high speed. A further advantage in terms of speed is 
that the approach enables efficient parallel processing 
(in other words, if for instance the search branches off 
into two subtrees, each can be processed with a separate 
5 processor) . 

In the following the invention and its preferred 
embodiments will be explained with reference to Figures 
2 to 6 by way of examples in accordance with the accom- 
panying drawings, in which 
10 Figure 1 illustrates the use of an unidimensional 

digital trie structure in the maintenance of subscriber 
data in a telephone exchange, 

Figure 2 illustrates a multidimensional trie 
structure , 

15 Figure 3 illustrates the structure of an element 

in an internal node in the trie structure in accordance 
with the invention, 

Figure 4 illustrates nodes in the directory struc- 
ture employed in the memory in accordance with the 

20 invention, 

Figure 5 illustrates ways in which an element in 
an internal node (an element array index) can be deter- 
mined, the internal nodes being in accordance with 
Figure 4, and 

25 Figure 6 illustrates a memory arrangement in 

accordance with the invention on block diagram level. 

: As stated previously, in the present invention 
the trie structure has a multidimensional implementation 
(generally n-dimensional ) . Such • a multidimensional 

30 structure is otherwise fully similar to the unidimen- 

sional structure described at the beginning, but the 
element array comprised in the internal node is multi- 
dimensional. Figure 2 exemplifies a two-dimensional 2 2 *2* 
structure, in which one dimension in the element array 

35 comprises four elements and the other dimension two ele- 
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ments. Leaf nodes pointed to from the elements in the 
internal node are indicated with circles in the figure. 

Address computation in the multidimensional case 
is performed on the same principle as in the unidimen- 
5 sional case. The fundamental difference, however, 

resides in that instead of one element array index, an 
index is calculated for each dimension in the element 
array (n indices) . Each dimension thus has a search key 
space of its own {0, 1,..., 2 v i-l} (v i is the length of 
10 the search key in bits in each dimension and 

ie { 1, . . .n) ) . 

In order to represent the configuration of an 
individual element VA in an internal node, it is first 
defined what is understood under the term overflow ele- 

15 ment . When a new pointer is added to the directory tree 

(as a result of data inserted into the memory) , it may 
be at such a location in the internal node that already 
has a pointer referring to a leaf node. Such a situation 
is termed a collision. There are a . wide variety of 

20 methods for resolving collision situations. In view of 

curbing memory requirement, it is no use creating a new 
internal node for the colliding pointer at the next 
level of the directory tree, but the pointer is located 
in the element of the colliding internal node if that 

2 5 node contains free elements . Such elements are called 

overflow elements, and they thus comprise pointers that 
are not located in accordance with the above straight- 
forward address computation mechanism. In order that 
such records may be retrieved, the elements in the 

30 internal node must incorporate a separate identifier 

indicating the type of the pointer contained in said 
element, in other words, whether it refers to an 
internal node at the next level or to an overflow 
element (in the same internal node) . 

35 Figure 3 exemplifies the structure of an element 
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in the internal node in the directory structure of the 
memory in accordance with the invention. The internal 
node may be of the type directory node or overflow node. 
The type of the internal node is directory node if it 
5 contains no overflow elements, and overflow node if it 

contains even one overflow element. An element in turn 
may be a directory element, an overflow element, or a 
free element (not in use) . A directory element can refer 
to a directory node, overflow node, or leaf node. An 

10 overflow element, on the other hand, can only refer to 

an overflow node or to a leaf 'node. 

Each element in this example has a length of 4 0 
bits (5 bytes), comprising the following fields (length 
of field in parentheses): pointer field 31 (32 bits), 

15 pointer type field 32 (2 bits), and element type field 

33 (2 bits) . Furthermore, one vacant field 34 is left in 
the element. The type of the pointer can be encoded with 
the two bits of the pointer type field 32 for example as 
follows : 

20 -00: pointer not in use, 

- 01 : pointer to a directory node, 

- 10: pointer to an overflow node, and 

- 11: pointer to a leaf node . 

The type of the element can be encoded with the 
25 two bits of the element type field for example as fol- 

lows : 



.- 00 

- 01 

- 10 

30 - 11 



free element , 
directory element , 
overflow element , and 
not in use. 

In the multidimensional trie structure of the 
memory in accordance with the present invention, the 
size of each internal node (number of elements) is pre- 
determined by fixing the parameter k l of each dimension 
35 (number of search key bits to be searched in the 
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dimension) in advance on the basis of the desired size 
of an individual internal node in the direction of that 
dimension, and on the other hand on the desired depth of 
the structure in the direction of that dimension. The 
5 size of the internal node in the direction of each 

dimension is thus 2 k ^ elements, and the total number S of 
elements in the internal node is also a power of two: 



S = II 2 -2 kl x2 k2 x2 k2 x . . . =2 N (1) 



10 All elements in an internal node having n dimen- 

sions can thus be pointed to with n integers (n^2), each 
of which can have a value between { 0, 1 . . . 2 : ' : i-l } . Thus the 
predetermined parameters include the total length of the 
search key in each dimension and the number of bits 

15 searched in each dimension. If for example one dimension 

of a search key has 256 attributes (e.g. first names) at 
the most, the total length of the search key is 8 bits. 
If the number of bits to be searched at one time is' sel- 
ected to be for instance 1, the structure has a depth of 

20 8/1 = 8 in the direction of this dimension. By suitably 

selecting the number of searched bits in each dimension, 
for example depth can be set to be equal in all dimen- 
sions . 

Hence in accordance with the invention the number 
25 of searched bits can differ in each dimension, and thus 

also the need for storage space can • be controlled 
individually for each dimension. 

Figure 4 shows an example of an internal node V10 
used in the directory structure of the memory in accord- 
30 ance with the invention, employing a three-dimensional 

search key. In the direction of the first dimension (x) , 
the internal node has 2 2 =4 elements, in the direction of 
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the second dimension (y) it has 2 l = 2 elements, and in the 
direction of the third dimension (2) it has 2^=8 ele- 
ments, which gives a total of 2 6 =64 elements numbered 
0 ... 63. 

5 Since the memory space in practical hardware 

implementations (for example computer equipment) is uni- 
dimensional, the multidimensional array is linearized, 
i.e. converted to be unidirectional, in the address com- 
putation operation (that is, in proceeding in the search 

10 tree). The linearization is an arithmetic operation that 

can be performed on arrays of all sizes. 

In linearization., the elements in the array are 
numbered starting from zero (as shown in Figure 4), the 
number of the last element being one less than the 

15 product of the sizes of all dimensions. The number of an 

element is the sum of the products of each coordinate 
(for example in the three-dimensional case, the x , y and 
z coordinates) and the sizes of the dimensions preced- 
ing it. The number thus computed is employed directly as 

20 an index for the unidimensional array. 

In the case of the internal node shown in Figure 
4, the element number VA n is calculated in accordance 
with the above with the formula: 

25 VA n = x + yx4 + z*4x2 (2) 



where xe{0,l,2,3}, ye{0,l} and 26(0,1,2,3,4,5,6,7}. Thus 
for example for element 54 we obtain from the coor- 
dinates thereof (2,1,6): 2+1*4+6x4x2=2+4+4 8=54. 

30 When the ( n-dimensioned ) element array of an 

internal node of an n-dimensional trie structure is lin- 
earized, in accordance with the above the size of each 
dimension is 2 k i, where k, is the number of bits to be 
searched at a time in the dimension concerned. If a co- 

35 . ordinate in accordance with the dimension is denoted by 



WO 95/34155 



PCT/FI95/00319 



12 

reference a, ( j e ( 0 , 1 , 2 , . . . n } ) , the linearization can be 
written out as 

J2 a j n 2 ki . V jiajE 10,1 2*J ! - l) A k Q = 0 (3) 

7 = 1 2=0 

5 The linearization can be carried out by perform- 

ing a multiplication in accordance with formula (3); yet 
it is expedient to, perform the linearization by forming 
from the search key bits a bit string, the corresponding 
numeral indicating the element whose content provides 

10 the basis for proceeding in the directory tree. In 

forming the bit string, the bits extracted from the 
search key of each dimension are set in succession, and 
the final bit string thus obtained is interpreted as a 
binary numeral. This gives the same result as formula 

15 (3). 

Figure 5 illustrates the linearization to be per- 
formed both by way of multiplication and by way of bit 
string formation in connection with the internal node 
shown in Figure 4. As indicated previously, the numbers 

20 of search key bits to be interpreted in each dimension 

have been selected as follows: ^=2, k 2 =l and k 3 =3 . The 
search keys in the direction of each dimension are as 
follows: a-^1001, a 2 =10, and a 3 =110001. In forming the bit 
string/ the two leftmost bits (10) of the search key a 1 

25 in the first dimension (the first dimension is the one 

in the direction of which the numbering of the elements 
is commenced, starting from zero) are first extracted 
and taken as the least significant bits of the bit 
string to be formed. This is denoted by line 51. Next, 

30 the leftmost bit is extracted from the search key of the 

second dimension (line 52), and this bit is appended on 
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the side of the most significant bits (left-hand side) 
into the bit string to be formed. Lastly, the three 
leftmost bits are extracted (110, line 53) from the 
search key of the third dimension, and these bits are 
5 inserted as the leftmost bits into the bit string. The 

resulting bit string (110110) is interpreted as a binary 
numeral, which will give access to element 54. The same 
element will be accessed also by way of multiplication 
in accordance with formula (3), but in that case the 

10 selected search key bits are interpreted first as a 

binary numeral, and coefficients for .formula (3) are 
formed from the number thus obtained as shown in Figure 
5.. On the basis of the pointer contained in element 54, 
the next internal node is accessed, wherein the same 

15 operation is repeated with the unused search key bits. 

(In the exemplary case, element 9= (001001) 2 is selected 
from this internal node.) Each element in the internal 
node may refer either to another internal node (as 
element 3 to internal node Vll) or to a leaf node 

20 containing a record (as element 43 to record R) . In 

other cases, the element is not in use (is empty). 

With bit string formation the multiplication 
operations can be converted to addition operations and 
bit shifts (or to additions and bit-related OR oper- 

25 ations) . The mutual multiplication operations on numbers 

2 k i are replaced by addition operations on exponents k ± . 
The remaining multiplication of coordinate by a power 
of two will be converted to a bit shift operation by bit 
string formation (a multiplication by 2 k is known to cor- 

30 respond to a bit shift of k bits to the left) . As multi- 

plication will always take at least 13 clock cycles 
(with an Intel i4 86 processor, for instance), a bit 
shift can be performed in three clock cycles when the 
operands are stored in the processor register, in four 

35 clock cycles when the operands are to be found in the 
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cache of the processor, and in ten clock cycles even in 
the case that the operands are to be retrieved from 
memory. Addition, on the other hand, will only take one 
clock cycle in the registers and otherwise 2-3 or 8-9 

5 clock cycles depending on whether the operands are to be 

found in the cache of the processor. 

The above-described method by which a bit string 
is formed is advantageous in that it is order-preserving 
(in other words, the records are organized in the struc- 

0 ture in the order of the values of their search keys). 

Th-is is due- to the fact that k i most "significant (left- 
most) bits are extracted from the search key at a time. 
However, in principle it is possible to form the bit 
string also in such a way that k ; least significant 

5 (extreme right) bits are extracted from the search key. 

The bit string may also be formed by treating the 
dimensions in a different order than above, yet the 
above method in which the bit string to be formed is 
increased on the left is the simplest, because in that 

0 case there is no need to know how many positions are to 

be assigned for the bits of each dimension in the bit 
string . 

The multiplication in accordance with formula (3) 
may be employed for linearization (i.e., calculating an 

5 element array index) irrespective of the size of each 

dimension. On the other hand, bit string formation can 
only b.e employed where the size of the internal node 
(number of elements), in the direction of each dimension 
.is a power of two. Since the internal node of the 

0 directory structure in accordance with the present 

invention meets this requirement, it is advantageous to 
employ the bit string-formation method in the retrieval 
and storage methods in accordance with the invention, 
which is a more efficient (rapid) method than the multi- 

5 plication in accordance with formula (3) . The rapidity 
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is due to the fact that bit string formation will 
convert multiplications to additions and bit shifts, 
which are faster to perform. 

A directory structure as set forth above may be 
5 varied for example in such a way that a leaf node in the 

structure comprises more than one key-pointer pair. The 
difference then resides in that several search key com- 
parisons instead of one comparison are performed in the 
leaf node. 

0 Instead of a pointer, the record contained in the 

leaf node may comprise (besides the search key which is 
not indispensable) actual data. In this embodiment, the 
trie structure thus serves as an associative memory. 

In principle, storage maintenance includes three 

5 different operations: retrieval, addition (storage), and 

deletion. When a record is to be added to actual data 
(subscriber data at a telephone exchange, for instance), 
the addition is made either directly to a leaf node or 
elsewhere in the memory. In the latter case, a pointer 

d to the storage address at which the data record is 

stored is stored in the leaf node. 

Hence the search proceeds in such a way that a 
predetermined dimension-specific number of bits (k A ) is 
selected from the search key related to each dimension, 

5 and a search word is formed from the bits in the above 

manner, on the basis of which the element array index of 
the internal node at the root level is found. The ele- 
ment corresponding to the index will deliver the address 
of the next node, which is thus accessed. In the next 

3 node, a predetermined dimension-specific number of bits 

( k i ) is selected from among the unselected bits in the 
search key related to each dimension, and a search word 
is formed from these bits, employed for finding an 
internal node element. The selection of search key bits 

5 is continued until all bits in the search key have been 
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selected or the address of a node at the next lower 
level is the address of a leaf node. Thereafter, either 
(a) the pointer contained in the leaf node is read from 
the leaf node and the record is read from the location 
5 indicated by the pointer, or (b) the one or more records 

contained in the leaf node are read. If an empty element 
is encountered in proceeding in the directory structure, 
the search is terminated as unsuccessful (in other 
words, it is known that no data associated with the rel- 
10 evant key is stored in the memory) . 

Hence in accordance with the invention a predet- 
ermined dimension-specific number of bits (kj is 
selected from the unselected bits of the search key rel- 
ated to each dimension. In each dimension, the number to 
15 be selected is independent of the number selected in the 

other dimensions, and the number to be selected remains 
constant in the same dimension. 

When data are inserted, the search proceeds in 
the directory structure on the basis of a search key in 
a similar manner until an empty element is encountered 
or until the address of a new node at a lower level is 
the address of a leaf node. 

In the former case, a leaf node is created and 
the free element is set to point to this leaf node. 
Thereafter, the data are inserted either directly into 
the leaf node or elsewhere in the memory; in the last- 
mentioned case, a pointer to the storage address at 
which the added record was stored is stored in the leaf 
node . 

In the latter case, a leaf node already existing, 
the situation depends on whether the leaf node contains 
any space for an insertion (in other words, the 
situation depends on the size of the leaf node) . If for 
example only one record will fit into the leaf node (or 
the leaf node is otherwise full), the depth of the 
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directory structure must be increased. In that case, the 
element at the point of collision is set to refer to a 
new node in which element array indices are calculated 
for the keys. Adding of a new node is repeated if the 
5 indices are still equal. Thus the addition of a new node 

is repeated so many times that the indices are unequal. 
This is accomplished by means of identifying search keys 
at the latest when all bits in the search key are 
employed, i.e. when the depth of the structure has 
10 reached its maximum. If there is space in the leaf node, 

the pointer or the actual data record is stored in the 
overflow element of the leaf node. If a pointer is 
stored in the overflow element, it will again point to 
the storage address at which the actual data record is 
15 stored. The overflow possibility may only be realized in 

a leaf node of the directory structure. 

Deletion of data is performed in a manner known 
per se by finding, on the basis of the search key, the 
node containing a reference to a pointer or record to be 
20 deleted, as described above. If possible, the node is 

freed in order that storage space occupancy may be main- 
tained at a minimum all the time. Freeing is possible 
for example where the node contains one pointer to a 
leaf node and no pointer to a node at a lower level. 
25 The collision-prevention strategy to be chosen in 

^ each particular case usually has an effect not only on 

the insertion operations but also -on how the retrieval 
and deletion operations are to be performed. Thus the 
details of the implementation of these operations can 
30 vary, even though in all of them progress is made by the 

address computation method of the present invention. 
Since, however, such a variation is not relevant to the 
actual inventive idea and since the variation will take 
forms known to those skilled in the art, it will not be 
35 more closely explained in this context. 
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Figure 6 shows a memory in accordance with the 
invention on block diagram level. Each dimension has a 
dedicated input register, and hence there are n input 
registers. The search key of each dimension is stored in 

5 these input registers, each key in a register of its 

own. The input registers are connected to register TR in 
which the above-described search word is formed. The 
register TR is connected via adder 61 to the address 
input of memory MEM. The output of the memory in turn is 

0 connected to address register AR the output of which in 

turn is connected to adder 61. Initially the bits 
selected from each register are read into the common 
register TR in the correct order. The initial address of 
the first internal node is stored in the address 

5 register AR, and the address obtained as an offset 

address from register TR is added to the initial address 
in adder 61. The resulting address is supplied to the 
address input of the memory MEM, and the data output of 
the memory provides the initial address of the next in- 

D ternal node, the address being written into the address 

register AR over the previous address stored therein. 
Thereafter the next selected bits are again loaded from 
the input registers into the common register TR in the 
correct order, and the array address thus obtained is 

5 added to the initial address of the relevant array 

(i.e., internal node), obtained from the address regis- 
ter AR . . This address is supplied to the address input of 
the memory MEM, the data output of the memory thereafter 
providing the initial address of the next internal node. 

D The above-described procedure is repeated until- the 

desired position has been accessed and recordal can be 
performed or the desired register read. 

The rapidity of the method of the invention can 
be influenced by the type of hardware configuration 

5 chosen. Since progress is by way of the above-stated bit 
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manipulations, by selecting a given dimension-specific 
number of bits at a time from the search key of each 
dimension, address computation can be sped up by 
shifting from use of one processor to a multiprocessor 

5 environment in which parallel processing is carried out. 

An alternative to the multiprocessor environment is an 
ASIC circuit. 

In the above, the structure of the memory has 
been described in the form that it will typically assume 

0 in implementation. In practice, nonetheless, one leaf 

node is first created from all .nodes, and only after 
that first leaf node has overflown is the first internal 
node created, and the above-described directory struc- 
ture will start taking shape. Thus in principle it is 

5 even possible to realize address computation of the 

above-described kind in one very large leaf node (that 
will never overflow) that comprises a multidimensional 
array, one element in the array thus containing - if it 
is in use - a data unit or a pointer to a data unit. 

0 Hence in this case only the first k,^ bits are selected 

from the search key of each dimension. If the element 
found by means of these bits is empty, either an actual 
data record or a pointer to that storage address at 
which the data record is stored is stored in the 

5 element. If a collision situation arises in the storage, 

it will be resolved by methods known per se. Such a one- 
leaf embodiment may in practice be feasible if the 
distribution of stored data is appropriate. The effici- 
ency of storage space utilization, however, is- inferior 

0 to that of the basic embodiment disclosed above, which 

has a multidimensional directory structure. 

The one-leaf additional embodiment described 
above is, in a sense, a special case of a possible 
embodiment which also has a multidimensional directory 

5 structure, but along with that, address computation is 
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continued in the leaf node as well, as has been 
described previously (provided that unsearched bits 
still exist) . In that case, a leaf node has a structure 
similar to that of an internal node. 
5 Even though the invention has been explained in 

the above with reference to examples in accordance with 
the accompanying drawings, it is obvious that it is not 
to be so restricted, but it can be modified within the 
scope of the inventive idea disclosed above and in the 
10 appended claims. 
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Claims : 

1. A method for storing data identifiable by a 
search, key in memory, the data being stored as data 
5 units in a dedicated storage space assigned for each 

data unit, wherein storage is performed by proceeding, 
on the basis of a search key associated with the data 
unit to be stored, in a directory structure comprising a 
tree-shaped hierarchy having nodes at several different 

10 levels, wherein an individual node can be 

- an internal node comprising a multidimensional 
array wherein an individual element may contain the 
address of a lower node in the tree-shaped hierarchy and 
wherein an individual element may also be empty, or 

15 - a leaf node containing at least one pointer to 

a stored data unit, characterized by the 
following steps : 

(a) selecting from the search key related to each 
dimension a predetermined ' dimension-specif ic number (kj 

20 of bits and forming therefrom a search word on the basis 

of which the address of the next node is sought from the 
internal node at the root level of the tree-shaped hier- 
archy, and proceeding to said node, 

(b) selecting from the unselected bits in the 
25 search key related to each dimension a predetermined 

dimension-specific number (k A ) of bits and forming there- 
from a : search word with which the address of a further 
new node at a lower level is sought from the array of 
the node that has been accessed, 

30 (c) repeating step (b) until an empty element has 

been encountered or until the address of the new node at 
a lower level is the address of a leaf node, 

(d) storing a pointer in the leaf node and said 
data unit at the storage location indicated by the 

35 pointer. 



WO 95/34155 



PCT/FI95/00319 



2. A method as claimed in claim 1, charac- 
terized in that a new leaf node wherein storage 
is performed is created after step (c) in a known 
manner, if necessary. 
5 3. A method as claimed in claim 1, charac- 

terized in that said dimension-specific number 
(k:) is selected in such a way that the structure has 
equal depth in the direction of all dimensions and hence 
will favour partial key retrievals equitably in all 
10 directions. 

4. A method as claimed in claim 1, charac- 
terized in that said dimension-specific number 
(k<) is selected in such a way that the structure has 
greater depth in the direction of a given dimension/ 

15 given dimensions and hence will favour partial key re- 

trievals in that direction/those directions. 

5. A method as claimed in claim 1, charac- 
terized in that the search word is formed by set- 
ting the bits selected from the search key of each 

20 dimension in succession and interpreting the bit string 

thus obtained as a number indicating in the array of the 
internal node the element wherefrom the address of a 
lower node is sought. 

6. A method for storing data identifiable by a 
25 search key in memory, the data being stored as data 

units in a dedicated storage space assigned for each 
data unit, wherein storage is performed by proceeding, 
on the basis of a search key associated with the data 
unit to be stored, in a directory structure comprising a 

30 tree-shaped hierarchy having nodes at several different 

levels, wherein an individual node can be 

- an internal node comprising a multidimensional 
array wherein an individual element may contain the 
address of a lower node in the tree-shaped hierarchy and 

35 wherein an individual element may also be empty, or 
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- a leaf node in which at least one data unit is 
stored, characterized by the following 
steps: 

(a) selecting from the search key related to each 
5 dimension a predetermined dimension-specific number (kj 

of bits and forming therefrom a search word on the basis 
of which the address of the next node is sought from the 
internal node at the root level of the tree-shaped hier- 
archy, and proceeding to said node, 
0 (b) selecting from the unselected bits in the 

search key related to each dimension a predetermined 
dimension-specific number (k,) of bits and forming there- 
from a search word with which the address of a further 
new node at a lower level is sought from the array of 
5 the node that has been accessed, 

(c) repeating step (b) until an empty element has 
been encountered or until the address of the new node at 
a lower level is the address of a leaf node, 

(d) storing said data unit in the leaf node. 

0 7. A method as claimed in claim 6, charac- 

terized in that a new leaf node wherein storage 
is performed is created after step . (c) in a known 
manner, if necessary. 

8. A method as claimed in claim 6, charac- 

5 terized in that said dimension-specific number 

(kj is selected in such a way that the structure has 
equal d ; epth in the direction of all dimensions and hence 
will favour partial key retrievals equitably in all 
directions . 

0 9. A method as claimed in claim 6, charac- 

terized in that said dimension-specific number 
(k d ) is selected in such a way that the structure has 
greater depth in the direction of a given dimension/ 
given dimensions and hence will favour partial key re- 

5 trievals in that direction/those directions. 
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1 0 . A method as claimed in claim 6 , char- 
acterized in that the search word is formed by 
setting the bits selected from the search key of each 
dimension in succession and interpreting the bit string 

5 thus obtained as a number indicating in the array of the 

internal node the element wherefrom the address of a 
lower node is sought. 

11. A method for retrieving data identifiable by 
a search key from memory, the data being stored as data 

0 units in a dedicated storage space assigned for each 

data unit, wherein retrieval is performed by proceeding 
on the basis of a search key in a directory structure 
comprising a tree-shaped hierarchy having nodes at 
several different levels, wherein an individual node can 

5 be 

- an internal node comprising a multidimensional 
array wherein an individual element may contain the 
address of a lower node in the tree-shaped hierarchy and 
wherein an individual element may also be empty, or 
3 - a leaf node containing at least one pointer to 

a stored data unit, characterized by the 
following steps: 

(a) selecting from the search key related to each 
dimension a predetermined dimension-specific number (k A ) 

5 of bits and forming therefrom a search word on the basis 

of which the address of the next node is sought from the 
internal node at the root level of the tree-shaped hier- 
archy, and proceeding to said node, 

(b) selecting from the unselected bits in the 
3 search key related to each dimension a predetermined 

dimension-specific number (k i ) of bits and forming there- 
from a search word with which the address of a further 
new node at a lower level is sought from the array of 
the node that has been accessed, 
5 (c) repeating step (b) until an empty element has 
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been encountered, until the address of the new node at a 
lower level is the address of a leaf node or until all 
bits of the search key have been selected, and 

if an empty element was not encountered in step 

5 (c) , 

(d) reading from the leaf node the pointer con- 
tained therein and reading the data unit at the location 
indicated by the pointer. 

12. A method as claimed in claim 11, char- 
10 acterized in that said dimension-specific num- 

ber (k,) is selected in such a way that the structure has 
equal depth in the direction of all dimensions and hence 
will favour partial key retrievals equitably in all 
directions . 

15 13. A method as claimed in claim 11, char- 

acterized in that said dimension-specific num- 
ber (kj is selected in such a way that the structure has 
greater depth in the direction of a given dimension/ 
given dimensions and hence will favour partial key re- 
trieval in that direction/those directions. 

14. A method as claimed in claim 11, char- 
acterized in that the search word is formed by 
setting the bits selected from the search key of each 
dimension in succession and interpreting the bit string 

25 thus obtained as a number indicating in the array of the 

internal node the element wherefrom the address of a 
lower node is sought. 

15. A method for retrieving data identifiable by 
a search key from memory, the data being stored as data 

30 units in a dedicated storage space assigned for each 

data unit, wherein retrieval is performed by proceeding 
on the basis of a search key in a directory structure 
comprising a tree-shaped hierarchy having nodes at 
several different levels, wherein an individual node can 

35 be 
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- an internal node comprising a multidimensional 
array wherein an individual element may contain the 
address of a lower node in the tree-shaped hierarchy and 
wherein an individual element may also be empty, or 
5 _ a leaf node in which at least one data unit is 

stored, characterized by the following 
steps : 

(a) selecting from the search key related to each 
dimension a predetermined dimension-specific number (k t ) 

10 of bins and forming therefrom a search word on the basis 

of which the address of the next node is sought from the 
internal node at the root level of the tree-shaped hier- 
archy, and proceeding to said node, 

(b) selecting from the uhselected bits in the 
15 search key related to each dimension a predetermined 

dimension-specific number (k,) of bits and forming there- 
from a search word with which the address of a further 
new node at a lower level is sought from the array of 
the node that has been accessed, 
20 (c) repeating step (b) until an empty element has 

been encountered, until the address of the new node at a 
lower level is the address of a leaf node or until all 
bits of the search key have been selected, and 

if an empty element was not encountered in step 

25 (c), 

(d) reading from the leaf node one or more data 
units contained therein . 

16. A method as claimed in claim 15, char- 
acterized in that said dimension-specific num- 

30 ber (k.) is selected in such a way that the structure has 

equal depth in the direction of all dimensions and hence 
will favour partial key retrievals equitably in all 
directions. 

17. A method as claimed in claim 15, char- 
35 acterized in that said dimension-specific num- 
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ber (kj is selected m such a way that the structure has 
greater depth in the direction of a given dimension/ 
given dimensions and hence will favour partial key re- 
trievals in that direction/those directions. 

18. A method for storing data identifiable by a 
search key in memory, the data being stored as data 
units in a dedicated storage space assigned for each 
data unit, wherein storage is performed by proceeding, 
on the basis of a search key associated with the data 
unit to be stored, in a directory structure having one 
■ node comprising a multidimensional array wherein an 
individual element may contain a pointer to a data unit 
or wherein an individual element may also be empty, 
characterized by 
15 selecting from the search key related to each 

dimension a predetermined dimension-specific number (k,) 
of bits and forming therefrom a search word on the basis 
of which an element is sought from the array of the 
node, and if said element is empty, storing a pointer in 
the element and said data unit at the storage location 
indicated by the pointer. 

19. A method as claimed in claim 18, c h a r - 
acterized'in that the search word is formed by 
setting the bits selected from the search key of each 

25 dimension in succession and interpreting the bit string 

thus obtained as a number indicating said element. 

20. A method for storing data identifiable by a 
search key in memory, the data being stored as data 
units in a dedicated storage space assigned for each 
data unit, wherein storage is performed by proceeding, 
on the basis of a search key associated with the data 
unit to be stored, in a directory structure having one 
node comprising a multidimensional array wherein an 
individual element may contain a data unit or wherein an 

35 individual element may also be empty, - c h a r a c - 
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selecting from the search key related to each 
dimension a predetermined dimension-specific number (kj) 
of bits and forming therefrom a search word on the basis 
5 of which an element is sought from the array of the 

node, and if said element is empty, storing the data 
unit in said element. 

21. A method as claimed in claim 20, char- 
acterized in that the search word is formed by 

10 setting the bits selected from the search key of each 

dimension in succession and interpreting the bit string 
thus obtained as a number indicating said element. 

22. A method for retrieving data identifiable by 
a search key from memory, the data being stored as data 

15 units in a dedicated storage space assigned for each 

data unit, wherein retrieval is performed by proceeding, 
on the basis of a search key associated with the data 
unit to be stored, in a directory structure having one 
node comprising a multidimensional array wherein an 

20 individual node may contain a pointer to said data unit 

or wherein an individual node may also be empty, 
characterized by 

selecting from the search key related to each 
dimension a predetermined dimension-specific (kj number 

25 of bits and forming therefrom a search word on the basis 

of which an element is sought from the array of the 
node, and if said element is empty, storing a pointer in 
said element and the data unit at the storage location 
indicated by the pointer. 

30 23. A method as claimed in claim 22, char- 

acterized in that the search word is formed by 
setting the bits selected from the search key of each 
dimension in succession and interpreting the bit string 
thus obtained as a number indicating said element. 

35 2 4. A method for retrieving data - identifiable by 
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a search key from memory, the data being stored as data 
units in a dedicated storage space assigned for each 
data unit, wherein retrieval is performed by proceeding, 
on the basis of a search key associated with the data 
unit to be stored, in a directory structure having one 
node comprising a multidimensional array wherein an 
individual element may contain a data unit or wherein an 
individual element may also be empty, 
characterized by 

selecting from the search key related to each 
dimension a predetermined dimension-specific number (k i ) 
of bits and forming therefrom a search word on the basis 
of which an element is sought from the array of the 
node, and if said element is empty, storing the data 
15 unit in said element. 

25. An associative memory arrangement for storing 
data in electrical form, the data being stored as data 
units in a dedicated storage space assigned for each 
data unit in the memory, the memory arrangement compris- 
20 ing a directory structure which is a multidimensional 

digital trie structure comprising a tree-shaped hier- 
archy having nodes at several different levels, wherein 
a node can be 

- an internal node comprising a multidimensional 
25 array wherein an individual non-empty element comprises 

a pointer (31) guiding progress in the directory struc- 
ture, or 

- a leaf node containing one or more pointers to 
a data unit stored in the memory, 

30 each data unit to be stored in the memory being 

associated with a multidimensional search key comprising 
a search key for each dimension, on the basis of which 
search keys progress is made in the directory structure, 
the directory structure returning the data unit associ- 

35 ated with the multidimensional search key employed . in 
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each case, characterized in that the size 
of an individual multidimensional internal node in the 
directory structure has been fixed by predetermining the 
number of search key bits k i to be searched in each 
5 dimension independently of the other dimensions, the 

size of the internal node in each dimension being 2 k *. 

26. An associative memory arrangement for storing 
data in electrical form, the data being stored as data 
units in a dedicated storage space assigned for each 
10 data unit in the memory, the memory arrangement compris- 

ing a directory structure which 'is a multidimensional 
digital trie structure comprising a tree-shaped hier- 
archy having nodes at several different levels, wherein 
a node can be 

15 - an internal node comprising a multidimensional 

array wherein an individual non-empty element comprises 
a pointer ( 31 ) guiding progress in the directory struc- 
ture, or 

- a leaf node containing at least one data unit, 
20 each data unit to be stored in the memory being 

associated with a multidimensional search key comprising 
a search key for each dimension, on the basis of which 
search keys progress is made in the directory structure, 
the directory structure returning the data unit associ- 
25 ated with the multidimensional search key employed in 

each case, characterized in that the - size 
of an individual multidimensional internal node in the 
directory structure has been fixed by predetermining the 
number of search key bits ki to be searched in each 
30 dimension independently of the other dimensions, the 

size of the internal node in each dimension being 2 k i. 
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